Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
👁️ Attention Optimization
Flash Attention, Memory Efficient, Sparse Attention, Transformers
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
82103
posts in
548.8
ms
Attention
Retention
for
Continual
Learning with Vision Transformers
arxiv.org
·
2d
🧩
Attention Kernels
Sequential Attention: Making AI models
leaner
and faster without
sacrificing
accuracy
research.google
·
4d
·
Discuss:
Hacker News
,
r/LocalLLaMA
🧩
Attention Kernels
Regain
Your Focus
donotnotify.com
·
11h
🧩
Attention Kernels
The Key to State
Reduction
in Linear Attention: A
Rank-based
Perspective
arxiv.org
·
3d
🧩
Attention Kernels
Crafting the Eyes for Thinking Machines: Rewiring the
Retina
- The Anatomy of
ViTStruct
pub.towardsai.net
·
1d
🧩
Attention Kernels
Focal
Self-attention for Local-Global
Interactions
in Vision Transformers
dev.to
·
3d
·
Discuss:
DEV
🧩
Attention Kernels
Show HN: Model Training Memory
Simulator
czheo.github.io
·
13h
·
Discuss:
Hacker News
📊
Gradient Accumulation
A
Normalized
Gaussian
Wasserstein
Distance for Tiny Object Detection
paperium.net
·
9h
·
Discuss:
DEV
🧮
cuDNN
🥇Top AI
Papers
of the Week
nlp.elvissaravia.com
·
8h
⚡
ONNX Runtime
AI Sees And
Understands
Images Far More
Efficiently
With New Embedding Technique
quantumzeitgeist.com
·
2d
🧩
Attention Kernels
AI Search Engine Performance Optimization Systems
open.forem.com
·
10h
·
Discuss:
DEV
🤖
AI Coding Tools
Neural population
geometry
and optimal coding of tasks with shared
latent
structure
nature.com
·
2d
📊
Gradient Accumulation
Human-like Search for Modern
Applications
anvitra.ai
·
20h
·
Discuss:
Hacker News
🧩
Attention Kernels
Main
Content ||
Math
∩ Programming
jeremykun.com
·
42m
📉
Model Quantization
Learning Models with Uniform Performance via
Distributionally
RobustOptimization
dev.to
·
1d
·
Discuss:
DEV
📊
Gradient Accumulation
Pattern Mapping for ADHD &
Neurodivergent
Minds
unloop.so
·
8h
🧩
Attention Kernels
Performance
Tip
of the Week #7: Optimizing for application
productivity
abseil.io
·
1d
⚙️
Systems Programming
Habit
Detection For Home
Assistant
hackaday.com
·
5h
🧩
Attention Kernels
How I
squeezed
a
BERT
sentiment analyzer into 1GB RAM on a $5 VPS
mohammedeabdelaziz.github.io
·
1d
·
Discuss:
Hacker News
🏎️
TensorRT
Continual
learning and the post
monolith
AI era
baseten.co
·
2d
·
Discuss:
Hacker News
📊
Gradient Accumulation
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help